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Abstract — This paper considers tlie multiantenna broadcast 
cliannel witiiout transmit-side ciiannel state information (CSIT). 
For tliis cliannel, it lias been Itnown tliat wlien all receivers have 
channel state information (CSIR), the degrees of freedom (DoF) 
cannot be improved beyond what is available via TDMA. The 
same is true if none of the receivers possess CSIR. This paper 
shows that an entirely new scenario emerges when receivers 
have unequal CSIR. In particular, orthogonal transmission is 
no longer DoF-optimal when one receiver has CSIR and the 
other does not. A multiplicative superposition is proposed for 
this scenario and shown to attain the optimal degrees of freedom 
under a wide set of antenna configurations and coherence 
lengths. Two signaling schemes are constructed based on the 
multiplicative superposition. In the first method, the messages of 
the two receivers are carried in the row and column spaces of a 
matrix, respectively. This method works better than orthogonal 
transmission while reception at each receiver is still interference- 
free. The second method uses coherent signaling for the receiver 
with CSIR, and Grassmannian signaling for the receiver without 
CSIR. This second method requires interference cancellation at 
the receiver with CSIR, but achieves higher DoF than the first 
method. 



I. Introduction 

In the MIMO broadcast channel, when channel state infor- 
mation is available at the receiver (CSIR) but not at the trans- 
mitter (CSIT), orthogonal transmission (e.g., TDMA) achieves 
optimal degrees of freedom (DoF) [1], [2]. With neither CSIT 
nor CSIR, again orthogonal transmission achieves the best 
possible DoF [3]. This paper studies the broadcast channel 
where one receiver has full CSIR and another has no CSIR. In 
this case, new DoF gains are discovered that can be unlocked 
with novel signaling strategies. 

The study of broadcast channels with unequal CSIR is 
motivated by downlink scenarios where users have different 
mobilities. Subject to a fixed downlink pilot transmission 
schedule, the low-mobility users have the opportunity to 
reliably estimate their channels, while the high-mobility users 
may not have the same opportunity. 

The main result of this paper is that when one receiver has 
full CSIR and the other has none, the achieved DoF is strictly 
better than that obtained by orthogonal transmission. For the 
unequal CSIR scenario, we propose a product superposition, 
where the signals of the two receivers are multiplied to produce 
the broadcast signal. In the following the receiver with full 
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CSIR is referred to as the static receiver and the receiver 
with no CSIR as the dynamic receiver. Two classes of product 
superposition signaling are proposed: 

• In the first method, information for both receivers is 
conveyed by the row and column spaces of a transmit 
signal matrix, respectively. The signal matrix is con- 
structed from a product of two signals that lie on different 
Grassmannians. The two receivers do not interfere with 
each other even though there is no CSIT, a main point of 
departure from traditional superposition broadcasting [1], 
[4]. 

• In the second method, information for the static receiver 
is carried by the signal matrix values (coherent signaling), 
while information for the dynamic receiver is transported 
on the Grassmannian. The static receiver is required to 
decode and cancel interference, therefore this method 
is slightly more involved, but it achieves higher DoF 
compared with the first method. 

Using the proposed methods, the exact DoF region is found 
when Ni < N2 < M, T > 2Ni, where iVi, N2 and M 
are the number of antennas at the dynamic receiver, static 
receiver and transmitter, respectively, and T is the channel 
coherence time of the dynamic receiver For N2 < Ni < 
M, T > 2Ni, we partially characterize the DoF region when 
either the channel is the more capable type [5], or when the 
message set is degraded [6]. 

We use the following notation throughout the paper: for a 
matrix A, the transpose is denoted with A*, the conjugate 
transpose with A^, and the element in row i and column j 
with [A]ij. The k x k identity matrix is denoted with Ik- The 
set of n X m complex matrices is denoted with C"^™. 

The organization of this paper is as follows. In Section |ll] 
we introduce the system model and preliminary results. Two 
signaling methods are proposed and studied in Section Hill and 
Section |IV] respectively. 

II. System Model and Preliminaries 

We consider a broadcast channel with an M-antenna trans- 
mitter and two receivers. One receiver has access to channel 
state information (CSI), and is referred to as the static receiver 
The other receiver has no CSI, e.g. due to mobility, and is 
referred to as the dynamic receiver. The dynamic receiver 
has iVi antennas and the static receiver has N2 antennas. 
Denote the channel coefficient matrices from the transmitter 
to the dynamic and static receivers by Hi e (jA^ixa/ ^jj^j 
H2 e C^^''*^, respectively. We assume that Hi is constant for 
T symbols (block-fading) and is unknown to both receivers. 



2 



Dynamic Rx 




Static Rx 



Fig. 1. Channel model. 



while H2 is known by the static receiver but not known by 
the dynamic receiver!^ Neither Hi nor H2 is known by the 
transmitter (no CSIT). 

Over T time-slots (symbols) the transmitter sends X = 
[xi,-- - ,XA/]* across AI antennas, where G C'^^^ is the 
signal vector sent by the antenna i. The normalized signal at 
the dynamic and static receivers is respectively 

1 



Yi -HiX+— Wi, 

Y2 = H2X+^W2, 



(1) 



where Wi e C^i""^ and W2 G C^^xt ^^.^ additive noise 
with i.i.d. entries CA/'(0, 1). Each row of Yi G C^i>^t (^j. 
Y2 G C^2xT-j corresponds to the received signal at an antenna 
of the dynamic receiver (or the static receiver) over T time- 
slots. The transmitter is assumed to have an average power 
constraint p, and therefore, in the normalized channel model 
given by ([T]i, the average power constraint is: 



M 



E[^tr(x,xt)] =T. 



(2) 



i=l 



The channel Hi has i.i.d. entries with zero mean and unit 
variance, but we do not assign any specific distribution for 
Hi. This general model includes Rayleigh fading as a special 
case where the entries of Hi are i.i.d. CAf{0, 1). The channel 
H2 is assumed to have full rank; this assumption, e.g., holds 
with probability 1 if the entries of H2 are drawn independently 
according to a continuous distribution. We focus on the case 
of M = max(Afi,7V2) and T > 2Ni, which is motivated by 
the fact that having more transmit antennas does not increase 
the multiplexing gain for either receiver, and the fact that if 
T < 2Ni, some of the antennas of the dynamic receiver can 
be deactivated without any loss in the degrees of freedom 
(DoF) [7]. 

The degrees of freedom at receiver i is defined as: 

di = Imi , 

p^oo logp 

where Ri (p) is the rate of the dynamic receiver for i = 1 and 
the static receiver for i — 2, respectively. 

'in practice H2 for a static receiver may vary across intervals of length 
much greater than T. However, for the purposes of this paper, once H2 
is assumed to be known to the static receiver, its time vaiiation (or lack 
thereof) does not play any role in the subsequent mathematical developments. 
Therefore in the interest of elegance and for a minimal description of the 
requirements for the results, we only state that H2 is known. 



A. Definitions 

Definition 1 (Isotropically Distributed Matrix [8]): A ran- 
dom matrix X G (j*:xn^ where n > k, is called isotropically 
distributed (i.d.) if its distribution is invariant under unitary 
transformations, i.e., for any deterministic nxn unitary matrix 

p(X)=p(X*). (3) 
An example of i.d. matrices is X with i.i.d. CAf{0, 1) entries. 

Remark 1: An interesting property of i.d. matrices is that if 
X is i.d. and is a random unitary matrix that is independent 
of X, then X$ is independent of $ [7, Lemma 4]. That is, 
any rotation to an i.d. matrix is essentially "invisible." 

Definition 2 (Stief el manifold [9]}: The Stiefel manifold 
F(n, k), where n > fc, is the set of all fc x n unitary matrices, 
i.e., 

F(n,/c) = {Q gC''^" : QQt = 1^}. 
For k — 1, the manifold F(n, 1) is the collection of all n- 
dimensional vectors with unit norm, i.e., the surface of a unit 
ball. 

Definition 3 (Grassmann manifold [9]): The Grassmann 
manifold k), where n > A:, is the set of all fc-dimensional 
subspaces of C". 

Remark 2: The (complex) dimension of (G(n. k) is 



dim (G(rt, fc)) = k{n — fc), 



(4) 



i.e., each point in k) has a neighborhood that is equiva- 
lent (homeomorphic) to a ball in the Euclidean space of com- 
plex dimension k{n~k). The dimensionality of Grassmannian 
can also be viewed as follows. For any matrix Q, there exists 
a k X k full rank matrix U so that 



Q* = UQ 



1 











1 Xk,k+1 



(5) 



where Q and Q* span the same row space. Therefore, each 
point in G(n, k) is determined by k{n—k) complex parameters 



for 1 < j < fc and k 
dimension subspace in C" 
complex variables. 



^ 1 < z < n. In other words, a k- 
is uniquely decided by k[n — k) 



B. Non-coherent Point-to-point Channels 

The analysis in this paper uses insights and results from non- 
coherent communication in point-to-point MIMO channels, 
which are briefly outlined below. 

1) Intuition: Consider a point-to-point M x N MIMO 
channel where the receiver does not know the channel H, 
namely a non-coherent channel. 

At high SNR the additive noise is negligible, so the received 
signal Y w HX, where X is the transmitted signal. Because 
X is multiplied by a random and unknown H, the receiver 
cannot decode X. However, communication is still possible 
because, for any non-singular H, the received signal Y spans 
the same row space as X. Therefore, the row space of X can 
be used to carry information without the need to know H, i.e., 
the codebook consists of matrices with different row spaces. 
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Conveying information via subspaces can be viewed as 
communication on the Grassmann manifold where each dis- 
tinct point in the manifold represents a different subspace [7]. 
In this case, the codewords (information) are represented by 
subspaces, which differs from the coherent communication 
that maps each codeword into one point in a Euclidean 
space [10]. Intuitively, the information of a Grassmannian 
codeword is carried by k{n — k) variables, as seen in 

2) Optimal Signaling: The design of an optimal signaling 
can be viewed as sphere packing over Grassmannians [7]. 
At high SNR, the optimal signals are isotropically distributed 
unitary matrices [8], [7]. In addition, the optimal number of 
transmit antennas depends on the channel coherence time. For 
a short coherence interval, using fewer antennas may lead 
to a higher capacity, and vice versa. The optimal number of 
transmit antennas is 

K ^mm{M,N,[T/2\), (6) 

where T is the channel coherence time, i.e., the number of 
symbols that the channel remains constant. Therefore, the 
optimal signals are K xT unitary matrices. In other words, K 
antennas (K < M) are in use and they transmit equal-energy 
and mutually orthogonal vectors. These unitary matrices reside 
in G{T,K) and each is interpreted as a representation of the 
subspace it spans. This method achieves the maximum DoF 
K{T — K) over T time-slots. Note that the DoF coincides 
with the dimensionality of the Grassmannian G(r, K). 

3) Subspace Decoding: Unlike coherent communication, 
in non-coherent signaling the information is embedded in 
the subspaces instead of the signal values. As long as two 
matrices span the same subspace, they correspond to the 
same message. Maximum-likelihood decoding chooses the 
codeword whose corresponding subspace is the closest one 
to the subspace spanned by the received signal. For example 
in [11], the received signals are projected on the subspaces 
spanned by different codewords, and then the one is chosen 
with the maximum projection energy. More precisely, for the 
transmitted signals S C^^^ from a unitary codebook X, 
and the received signals Y G C^^^, the ML detector is 

±ML = arg max tr{YX|X,Yt}. (7) 

C. A Baseline Scheme: Orthogonal Transmission 

For the purposes of establishing a baseline for comparison, 
we begin by considering a time-sharing (orthogonal transmis- 
sion) that acquires CSIR via training in each interval and uses 
Gaussian signaling. This baseline method has been chosen to 
highlight the differences of the heterogeneous MIMO broad- 
cast channel of this paper with two other known scenarios: 
It is known that for a broadcast channel with no CSIT and 
perfect CSIR, orthogonal transmission ahieves the optimal 
DoF region [2]. Also, a training -based method with Gaussian 
signaling is sufficient to achieve DoF optimality [7] for the 
point-to-point noncoherent MIMO channejl. 

^Grassmannian signaling is superior, but tlie same slope of the rate vs. SNR 
curve is obtained with training and Gaussian signaling in the point-to-point 
MIMO channel. 



In orthogonal transmission, the transmitter communicates 
with the two receivers in a time-sharing manner When trans- 
mitting to the dynamic receiver, it is optimal if the transmitter 
activates only K out of M antennas: it sends pilots from the K 
antennas sequentially over the first K time-slots; the dynamic 
receiver estimates the channel by using, e.g., minimum-mean- 
square-error (MMSE) estimation. Then, the transmitter sends 
data during the remaining (T—K) time-slots, and the dynamic 
receiver decodes the data by using the estimated channel 
coefficients [7], [12]. Using this strategy, the maximum rate 
achieved by the dynamic receiver is: 

)logP + 0(l). (8) 

The operating point in the achievable DoF region where 
the transmitter communicates exclusively with the dynamic 
receiver is denoted with T>i. 

V, = {K{l-^),0). (9) 

For the static receiver the channel is assumed to be known 
at the receiver, therefore data is transmitted to it coherently. 
The maximum rate achieved by the static receiver is [13] 

miii(M,iV2)logp + 0(l). (10) 

The operating point in the DoF region where the transmitter 
communicates only with the static receiver is denoted with 

2?2 - (0, min(Af,iV2)). (11) 

Time-sharing between the two points of Vi and I?2 yields 
the achievable DoF region 

(^tK{l - (1 - t) min(M, N^)^ , (12) 

where t is a time-sharing variable. 

III. Grassmannian Superposition for Broadcast 
Channel 

In this section, we propose a signaling method that attains 
DoF region superior to orthogonal transmission, and allows 
each receiver to decode its message while being oblivious of 
the other receiver's message. 

A. A Toy Example 

Consider M = N2 ^ 2, Ni ^ 1 and T = 2. From 
Section III-CI orthogonal transmission attains 1/2 DoF per 
time-slot for the dynamic receiver and 2 DoF per time-slot for 
the static receiver. By time-sharing between the two receivers, 
the following DoF region is achieved 

2-2t), (13) 

where t E [0, 1] is a time-sharing parameter 

We now consider the transmitter sends a product of signal 
vectors over 2 time-slots 

X = X2X* e (14) 
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where xi = [: 



(1) ^(i)it 



X 



]* and X2 = [j 



(2) A2ht 

1 ^9 



are the signals 



for the dynamic receiver and the static receiver, respectively. 
The vectors xi and X2 have unit-norm and from codebooks 
that he on (G(2, 1). 

The signal at the dynamic receiver is 



yi 



(2)' 
= 1 

n(2) 



1 



„(1) „„(l)l 



VP 



(15) 



where [ft.j"'"'' 
and 



is the isotropically distributed channel vec- 
tor, and h'^^' is the equivalent channel coefficient seen by the 
dynamic receiver 

The subspace spanned by x| is the same as /I'^^^x^, so 
at high SNR the dynamic receiver is able to determine the 
direction specified by x*. From Section III-BI the dynamic 
receiver attains 1/2 DoF per time-slot, which is optimal even 
in the absence of the static receiver 

Consider the signal of the static receiver at time-slot 1: 



y2 = H2 



(2)' 

x\ 




1 


(2)' 

w\ 


(2) 
X2 




"7^ 


(2) 
."^2 . 



(16) 



Because the static receiver knows H2, it can invert the 
channel as long as H2 is non-singular: 



(H2-V2)*=.xl 



(l)r (2) (2). 



r (2) (2)- 



(17) 



The equivalent (unknown) channel seen by the static receiver 
is x^^ , i.e., part of the dynamic receiver's signal. Using 
Grassmannian signaling via the subspace of X2, the DoF 
achieved is again 1/2 per time-slot. 

Time-sharing between the proposed scheme and 2?2 (trans- 
mitting only to the static receiver) yields the achievable DoF 
region 



i-t, 2 
'^2 ' 



2 



(18) 



The above region is strictly larger than that of orthogonal 
transmission, as shown in Figure|2] The static receiver achieves 
1/2 DoF "for free" in the sense that this DoF was extracted 
for the static receiver without reducing the dynamic receiver's 
DoF 



B. Grassmannian Superposition Signaling 

Based on the previous example, we design a general sig- 
naling method (the Grassmannian superposition) with two 
properties: (1) information is carried by subspaces and (2) two 
signal matrices are superimposed multiplicatively so that their 
row (or column) space is unaffected by multiplying the other 
receiver's signal matrix. Two separate cases are considered 
based on whether the number of static receiver antennas is 
larger than the number of dynamic receiver antennas. 

^The noise enhancement induced by cliannel inversion will not affect tlie 
DoF of tlie static receiver. 
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Fig. 2. DoF region of tlie toy example 1. 

7) iVi < N2: The ti-ansmitter sends X e C^^^^ across 
M ^ N2 antennas over an interval of length T: 



X = 



X2X1, 



(19) 



where Xi e c^i^"^ and X2 G C^^^^i are the signals 
for the dynamic receiver and the static receiver, respectively. 
Here, ^JT/Nl is a normalizing factor to satisfy the power 
constraint Information for both receivers are sent over 
the Grassmannian, namely Xi is from a codebook Xi C 
G(T,iVi) and X2 is from a codebook X2 C G(N2,Ni). The 
codebook Xi and X2 are chosen to be isotropically distributed 
unitary matrices (see Section UlI-CI for more details). 

A sketch of the argument for the DoF achieved by the 
Grassmannian superposition is as follows. The noise is neg- 
ligible at high SNR, so the signal at the dynamic receiver is 
approximately 



-H1X2X1 G C 



NixT 



(20) 



The row space of Xi can be determined based on Yi, and 
then (T — Ni)Ni independent variables (DoF) that specify 
the row space are recovered, i.e., the transmitted point Xi in 
Xi e G{T,Ni) is found. 

For the static receiver, since H2 is known by the receiver, 
it inverts the channel (given that H2 is non-singular) 



X2X1 e C 



N2XT 



(21) 



which has approximately the same column space as X2. The 
transmitted point X2 in X2 G G(A^2,^i) will be recovered 
from the column space of H2"^Y2, producing {N2 - iVi)iVi 
DoF 

Therefore, the proposed scheme attains the DoF pair 



N,{l-^),^{N2-N,)j. 



(22) 



The result is more formally stated as follows: 
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Fig. 3. DoF region (Theorem[TJ: A^i = 2, N2 = 4. 



Theorem 1 (Ni < N2): Consider a broadcast channel with 
an M-antenna transmitter, a dynamic receiver and a static 
receiver with iVi and N2 antennas, respectively, with coher- 
ence time T for the dynamic channel. The Grassmannian 
superposition achieves the rate pair 

" i?i -7Vi(l-^)logp + 0(l) 
i?2 = ^(A^2-iVi)logp + 0(l) ■ 

The corresponding DoF pair is denoted 

If we denote the DoF for the single-user operating points for 
the dynamic and static user with Vi, I?2 respectively, the 
achievable DoF region consists of the convex hull of I?2 
and 2?3. 

Proof: See Appendix H] ■ 
From Theorem [T] the static receiver attains a "free" rate of 



A^i 

ARi = -^{N2-N,)\ogp + 0{l). 



(23) 



We plot the achievable DoF region of Theorem [T] in Fig- 
ure 13 For small T, the DoF gain achieved by the proposed 
method is significant, while as T increases, both methods 
approach the coherent upper bound [2] where both of the 
receivers have CSIR. For T — > 00, the rate gain ARi = 0(1), 
and no DoF gain is obtained. In this case, the achievable DoF 
region in Theorem[T]coincides with that attained by orthogonal 
transmission as well as the coherent outer bound [2]. This is 
not surprising, since if the channel remains constant (T 00), 
the resource used for obtaining CSIR is negligible. Finally, the 
rate gain ARi is an increasing function of {N2 — Ni), i.e., 
the extra antennas available for the static receiver. 

Now, we design the dimension of Xi and X2 in (l9[ to 
maximize the achievable DoF region. To find the optimal 
dimensions, we allow the signaling to use a flexible number 
of antennas and time slots, up to the maximum available. Let 
Xi e C^i^"^ and X2 € C'^^y-Ni^ where f<T,Ni<Ni and 
N2 < A^2- Theorem [T] does not immediately reveal the optimal 
values of Ni, N2, and T, because the rates are not monotonic 



in the mentioned parameters. The following corollary presents 
the optimal value of A^i, N2 and T. 

Corollary 1: For the Grassmannian superposition under 
iVi < N2, the signal dimension f = T, Ni = Ni and 
N2 = N2 optimizes the achievable DoF region. 

Proof: See Appendix HH ■ 

Thus, in the special case of A^i < N2, it is optimal to use 
all time slots and all antennas. 

2j iVi > N2: In this case, we shall see that sometimes the 
Grassmanian superposition may still outperform orthogonal 
transmission, but also under certain conditions (e.g. very large 
T or A^i ^ iV2) the Grassmannian superposition as described 
in this section may be not improve the DoF compared with 
orthogonal transmission. 

When A^i > N2, if the Grassmannian signaling to the 
dynamic receiver uses all the Ni dimensions, there will 
remain no room for communication with the static receiver. 
To allow the static user to also use the channel, the dynamic 
user must "back off" from using all the rate available to it, 
in other words, the dimensionality of the signaling for the 
dynamic receiver must be reduced. The largest value of A^i 
that makes A^i < N2 and thus allows nontrivial Grassmannian 
superposition is A^i = iV2 — 1. Once we are in this regime, 
the results of the subsection IIII-B . 1 1 can be used. Specifically, 
Corollary [T] indicates that de-activating any further dynamic 
user antennas will not improve the DoF region. Thus, given 
N2, and assuming we wish to have a non-trivial Grassmannian 
signaling for both users, using A^i = A^2 ^ 1 dimensions for 
signaling to the dynamic receiver maximizes the DoF region. 
The transmit signal is then 



X 



-X2X1, 



(24) 



where Xi e C'^N2-1)xT g ^Ar,x(Ar2-l)^ rpj^g 

sponding achievable DoF pair is 

2?4- ((A^2-l)(l-^^), (A^2-l)/r), (25) 

which leads to the following result. 

Corollary 2 (Ni > N2): Consider an Af -antenna transmit- 
ter broadcasting to a dynamic receiver and a static receiver 
with A^i and N2 antennas, respectively, with coherence time T 
for the dynamic channel. Then the Grassmannian superposition 
achieves the rate pair 



Ri 

i?2 



(A^2-l)(l 



N2 



N2 



log p + 0(1) 



^logp + 0(l) 

Denote the corresponding DoF pair with I?4. Together with the 
two single-user operating points 2?i and 2?2 obtained earlier, 
the achievable DoF region consists of the convex hull of Vi, 
I?2 and I?4. 

Proof: The proof follows directly by replacing A^i with 
(iV2 - 1) in Theorem [U ■ 
In Corollary |2] the DoF for the static receiver has not been 
achieved for "free" but at the expense of reducing the DoF 
for the dynamic receiver The transmitter uses only iV2 — 1 
dimensions for the dynamic receiver, which allows an extra 
DoF (iV2 - 1)/T to be attained at the static receiver If N1-N2 
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Fig. 4. DoF region (Corollary |2j: Ni = N2 = 4. 



and T are small, then the DoF gain of the static receiver 
outweighs the DoF loss for the dynamic, so that the overall 
achievable DoF region will be superior to that of orthogonal 
transmission. In contrast, if A^i N2 or T is large, the DoF 
loss from the dynamic receiver may not be compensated by the 
DoF gain from the static receiver, as illustrated by Figure |4] 
Therefore in the latter case orthogonal transmission may do 
better. The following corollary specifies the condition under 
which Grassmannian superposition improves DoF region com- 
pared with orthogonal transmission. 

Corollary 3: For A^i > N2, the Grassmannian superposi- 
tion improves DoF region with respect to orthogonal trans- 
mission if and only if 



N2 - {N2 - 1)/T 



{N2 - 1)(1 - {N2 ~ 1)/T) ^ 7Vi(l - iVi/T)- ^^^^ 
Proof: The necessary and sufficient condition for ensur- 
ing the improvement of the achievable DoF region is that the 
slope between I?2 and I?4 is larger than the slope between 2?i 
and I?2, which is equivalent to the inequality in the corollary. 



Remark 3: With unitary codebooks Xi and X2, information 
for both receivers is conveyed purely by the the subspace to 
which the codeword belongs. Consider X £ C'^^", n > k, 
which is uniquely represented by ft (the row space of X) and 
a k X k coefficient matrix C according to a certain basis of 
fl. The codewords Xi , X2 can be represented as 



Xi ^ (f)i,Ci), 

X2^(1^2,C2). 

In a manner similar to [7], one can verify 

/(Xi; Yi) = I{n,;Y,) + I{C,;Y,\n,), 



(27) 



(28) 



and 



J(X2;Y2|H2) =/(f^2;Y2|H2) 

+ /(C2;Y2|f^2,H2). 



(29) 



D. Multiplicative vs. Additive Superposition 

In this section, we compare product superposition with ad- 
ditive superposition. Under additive superposition, the transmit 
signal has a general expression 



/Cip Vi Xi + -y/C2P V2 X2, 



(30) 



where Vi and V2 are the precoding matrices, and ci and C2 
represent the power allocation. In this case, the signal at the 
dynamic receiver is 



/cIpHiViXi + ^/c^HiV2X2 + Wi. 



(31) 



Since Hi is unknown, the second interference term can- 
not be completely eliminated in general, which leads to a 
bounded signal-to-interference-plus-noise ratio (SINR), result- 
ing in zero DoF for the dynamic receiver 

For the multiplicative superposition, the signal at the dy- 
namic receiver is 



C. Design of Xi and X2 

The representation of a point in the Grassmannian is not 
unique [14] (also see Remark|2]i, and therefore the codebooks 
Xi C G(r, iVi) and X2 C G(7V2, iVi) are not unique. 

First, X2 is chosen to be a unitary codebook. When X2 is 
unitary, for i.i.d. Rayleigh fading Hi, the equivalent dynamic 
channel Hi = H1X2 still has i.i.d. Rayleigh fading coeffi- 
cients [8]. Therefore, the static receiver is transparent to the 
dynamic receiver, which allows us to decouple and simplify 
the design of the two codebooks and their decoders. 

Once X2 is chosen to be a set of unitary matrices, com- 
munication between dynamic receiver and the transmitter is 
equivalent to a non-coherent point-to-point MIMO channel. 
Hence, to maximize the rate of the dynamic receiver at high 
SNR, Xi must also be a collection of isotropically distributed 
unitary matrices (see Section HHi. 



^HiX2Xi + Wi 
^HiXi +Wi, 



(32) 



where c is a power normalizing constant. For any unitary 
X2, X2X1 and Xi span the same row space. This invariant 
property of Grassmannian enables us to convey information 
to the static receiver via X2 without reducing the degrees 
of freedom of the dynamic receiver. Intuitively, the dynamic 
receiver does not have CSIR and is "insensitive" to rotation, 
i.e., the distribution of Yi does not depend on X2. 
For the static receiver, the received signal is 



^H2X2Xi+W2. 



(33) 



Because H2 is known, the channel rotation X2 is detectable, 
i.e., the distribution of Y2 depends on X2. Therefore X2 can 
be used to convey information for the static receiver. 
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IV. Grassmannian-Euclidean Superposition for 
THE Broadcast Channel 

We now propose a new transmission scheme based on 
successive interference cancellation, where the static receiver 
decodes and removes the signal for the dynamic receiver 
before decoding its own signal. This scheme improves the 
DoF region compared to the non-interfering Grassmannian 
signaling of the previous section. 



A. A Toy Example 

Consider M = A^i = iV2 = 1 and T = 2. Our approach is 
that over 2 time-slots, the transmitter sends 



1x2 



(34) 



where xi = [a;-j^ ] is the signal for the dynamic receiver 
and X2 is the signal for the static receiver Here, xi has unit- 
norm and is from a codebook Xi that is a subset of G(2, 1), 
and X2 can obey any distribution that satisfies the average 
power constraint. 

The signal at the dynamic receiver is 

1 



yi = hiX2[xl 



(1) ^(l)l 



„(1) „„(l)l 



^h, [4^) X^2^] + —[ 



1 



„(1) ,„(l)l 



(35) 



(36) 



where /ii is the channel coefficient of the dynamic receiver, 
and hi = hiX2 is the equivalent channel coefficient. The 
dynamic receiver can determine the row space spanned by 
Xi even though hi is unknown, in a manner similar to 
Section Illl-AI The total DoF conveyed by xi is 1 (thus ^ 
per time-slot); this is the optimal DoF under the same number 
of antennas and coherence time. 

For the static receiver, the received signal is: 



y2 = h2X2[x[ 



1 



„(2) „„(2)i 



= h2 [Xi 



(1) ^(l)l 



„(2) .„(2)l 



(37) 
(38) 



where /12 is the channel coefficient of the static receiver, and 
h2 — h2X2- Intuitively, since ( |36] | and (|38] | are equivalent, 
if the dynamic receiver decodes the subspace of xi, so does 
the static receiver. Then, the exact signal vector xi is known 
to the static receiver (recall that each subspace is uniquely 
represented by a signal matrix). The static receiver removes 
the interference signal xi 



y2xl 



h2X2 + --^W2, 

Vp 



(39) 



where W2 is the equivalent noise. Finally, the static receiver 
knows h2, so it decodes X2 and attains 1/2 DoF per time-slot. 

Therefore, the proposed scheme attains the maximum DoF 
for the dynamic receiver, meanwhile achieving 1/2 DoF for 
the static receiver. With time sharing between this scheme and 
2?2, the achievable DoF pair is 



{di,d2) 



1 
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Fig. 5. DoF region of tlie toy example 2. 

Figure |5] shows that this region is uniformly larger than that 
of orthogonal transmission. 

Remark 4: There are two key differences between the 
method proposed here and the Grassmannian superposition 
proposed in Section |III] First, the information for the static 
receiver is carried by the value of X2 instead of its direction 
(subspace), i.e., the signal for the static receiver is carried in 
the Euclidean space. Second, the static receiver must decode 
and remove the interference signal for the dynamic receiver 
before decoding its own signal, which is unlike the non- 
interfering method of the previous section. 

B. Grassmannian-Euclidean Superposition Signaling 

We denote the aforementioned method as Grassmannian- 
Euclidean superposition, whose generalization is the subject 
of this subsection. Two separate cases are considered based 
on whether the number of static receiver antennas is less than, 
or no less than, the number of dynamic receiver antennas. 

1) Ni < N2: The transmitter sends X e C^^""^ 



X = 



T 



-X2X1, 



(41) 



N1N2 

where Xi e C^^''^ and X2 G C^^^^^ are signals for the 
dynamic receiver and the static receiver, respectively. The 
signal Xi is from a Grassmannian codebook Xi C G{T, Ni), 
while X2 is from a conventional Gaussian codebook A2. The 
constant ^T/NiN2 is a power normalizing factor 

We now give a sketch of the argument of the DoF attained 
by the superposition signaling ( |4TI ). For the dynamic receiver, 
Yi w H1X2X1 at high SNR. When A^i < N2, the equivalent 
channel H1X2 G C^'-^^^ has full rank and does not change 
the row space of Xi. Recovering the row space of Xi 
produces {T — Ni)Ni DoF, which is similar to Section HIH 

For the static receiver, the signal at high SNR is 



T 



-H2 X2 Xi 



T 



_______ -H2X1. (42) 

N1N2 V ^1^2 

For Ni < N2, H2 = H2X2 e C'^^xN^ has full column rank 

(40) 

and does not change the the row space of Xi, and therefore. 
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Fig. 6. DoF region (Theorem [2): A^i = 2, 7V2 = 4. 



the signal intended for the dynamic receiver can be decoded 
by the static receiver. From the subspace spanned by Xi, the 
codeword Xi G Xi is identified. Then, Xi is peeled off from 
the static signal: 



Y2XT 



T 



-H2X2 e C 



N2XN1 



(43) 



Because H2 is known by the static receiver, Eq. (|43] l is a 
point-to-point MIMO channel. Therefore, N2N1 DoF can be 
communicated via X2 to the static receiver (over T time- 
slots) [13]. 

Altogether, the Grassmannian-Euclidean superposition at- 
tains the DoF pair 



- iVi(l- A^i/T), N2N1/T 



(44) 



More precisely, we have the following theorem. 

Theorem 2 (Ni < N2): Consider a broadcast channel with 
an M -antenna transmitter, a dynamic receiver and a static re- 
ceiver with A^i and N2 antennas, respectively, with coherence 
time T for the dynamic channel. The Grassmannian-Euclidean 
superposition achieves the rate pair 



i?i =iVi(l- ^)logp + 0(l) 



R2 = 



log p + 0(1) 



Denote the corresponding DoF pair by . Together with the 
two single-user operating points 2?i, 2?2 obtained earlier, the 
achievable DoF region consists of the convex hull of X'2 
and 2?5. 

Proof: See Appendix Hill ■ 
With the Grassmannian-Euclidean superposition, the static 
receiver attains the following gain compared with orthogonal 
transmission: 



Ai?2 = ^logP + 0(l). 



(45) 



From Figure |6] for relatively small T or large N2, the DoF 
gain is significant. For example, at T = 2Ni, the minimum 
coherence interval considered in this paper, the proposed 
method achieves a DoF N2/2 for the static receiver while 



attaining the maximum DoF Ni/2 for the dynamic receiver. As 
T increases the gain over orthogonal transmission decreases. 
In the Hmit T — > 00, we have AR2 — 0(1), and the DoF 
gain of Grassmannian-EucUdean superposition goes away. The 
Grassmannian-Euclidean superposition also provides DoF gain 
over the non-interfering Grassmannian superpositioij^ 



AR=^ log 



p + 0{l). 



(46) 



The optimal design of the dimensions of Xi and X2 is 
trivial, because the DoF region in Theorem|2]is indeed optimal 
(see Section lIV-Db . 

2) Ni > N2: When the static receiver has fewer antennas 
than the dynamic receiver, it may not be able to decode the 
dynamic signal. Here, we cannot directly apply the signaling 
structure given by dTIT l. A straightforward way is to activate 
only N2 antennas at the transmitter and use only N2 dimen- 
sions for the dynamic receiver, that is 



X = 




N2XT 



(47) 



-'N2XN2 



and yjT/Nj is a 



N2{1 



(48) 



where Xi (i C'^^xi X2 G 
power normalizing factor 

Following the same argument as the case of iVi < N2, the 
Grassmannian-Euclidean superposition achieves the DoF pair 

^) M 

Corollary 4 (Ni > N2): Consider a broadcast channel with 
an M -antenna transmitter, a dynamic receiver and a static re- 
ceiver with A^i and N2 antennas, respectively, with coherence 
time T for the dynamic channel. The Grassmannian-Euclidean 
superposition achieves the rate pair 

f i?i =iV2(l- ^)logp + 0(l) 

I i?2 = ^l0gp+0(l) 



Denote the corresponding DoF pair with Vq. Together with the 
two single-user operating points Pi and P2 obtained earlier, 
the achievable DoF region consists of the convex hull of Vi, 
2?2 and Vq. 

Proof: The proof directly follows from Theorem |2] ■ 
In Corollary ID the static rate receiver is obtained at the 
expense of a reduction in the dynamic rate. The transmitter 
uses only N2 out of A^i dimensions available for the dynamic 
receiver, which allows extra DoF iV|/r for the static re- 
ceiver A necessary and sufficient condition for Grassmannian- 
Euclidean superposition to improve the DoF region is as 
follows. 

Corollary 5: For the Grassmannian-Euclidean superposi- 
tion, the signal dimension T = T, Ni = Ni and N2 = N2 
optimizes the rate region at high SNR. Moreover, it achieves 
superior DoF region compared with orthogonal transmission 
if and only if 

A^2 > (1 - (49) 

Although Grassmannian-Euclidean superposition achieves larger DoF than 
the non-interfering Grassmannian superposition, it may not achieve larger 
rate at low or moderate SNR due to the decodable restriction on the rate 
(interference). 
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Proof: First, using the maximum number of static 
antennas (N2 = N2) is optimal, because both i?i and R2 
in Corollary |4] are increasing functions of N2 (note that 
A^2 < T/2). 

Second, we find the optimal T. Maximizing the achievable 
DoF region is equivalent to maximizing the slope of the line 
between I?2 and I?4, i.e.. 



(0, N2) and {N2{1 



N2 

f t 



(50) 



which has a constant slope —1 and is independent of T. 
Therefore, any choice of T, as long as T > 2N2, achieves 
a boundary point of the DoF region of the Grassmannian- 
Euclidean superposition. 

Finally, for the Grassmannian-Euclidean superposition to be 
superior to orthogonal transmission in term of DoF, the slope 
of the line between I?2 and must be larger than the slope 
between Vi and I?2, namely 

N2 



(1 - Ni/T)Ni 



> 1. 



(51) 



This completes the proof. ■ 
Corollary|5]can be interpreted as follows: the Grassmannian- 
Euclidean superposition achieves superior DoF if and only if 
the maximum DoF of the static receiver is larger than that of 
the dynamic receiver. 

C. Design of X\ and X2 

We heuristically argue that it is reasonable to choose Xi to 
be isotropically distributed unitary matrices and X2 to be i.i.d. 
complex Gaussian codebook. . 

Recall that the Grassmannian-Euclidean superposition is to 
allow the static receiver to decode the signal for the dynamic 
receiver and then remove this interference. After interference 
cancellation, the static receiver has an equivalent point-to-point 
MIMO channel with perfect CSIR, in which case Gaussian 
signal achieves capacity. 

Assuming X2 G X2 has i.i.d. CJ\f{0, 1) entries, the equiv- 
alent channel for the dynamic receiver H1X2 is isotropically 



distributed (see Definition [T]), which leads to two properties. 
First, for any T x T unitary matrix 



p(Yi*|Xi*)=p(Yi|Xi). 
Second, for any iVi x Ni unitary matrix \& 
p(Yi|*Xi) =p(Yi|Xi). 



(52) 



(53) 



Based on these properties, the optimal signaling structure for 
the channel of the dynamic receiver is a diagonal matrix^ 
times a unitary matrix [8], [7]. Therefore, choosing Xi to 
be isotropically distributed unitary matrices is not far from 
optimal. 

D. Degrees of Freedom Region 

In this section, we show that the Grassmannian-EucUdean 
superposition achieves the optimal DoF region under certain 
channel conditions. 

1) Ni < N2: In this case, the optimal DoF region is as 
follows. 

Corollary 6 (Ni < N2): When an 7\f-antenna transmitter 
transmits to a dynamic receiver and a static receiver with 
iVi and antennas, respectively, with the dynamic channel 
coherence time T, the DoF region is: 



di <iVi(l-^) 

di I d2_ ^ 1 



(54) 



Proof: An outer bound can be found when both receivers 
have CSIR. The DoF region of the coherent upper bound is [2] 



di d2 

Ni N2 



< 1. 



(55) 



An inner bound is attained by Grassmannian-Euclidean su- 
perposition, which reaches the boundary of dSSb except for 
di > Ni{l - Ni/T). However, the DoF of the dynamic 
receiver can never exceed iVi(l — Ni/T) (see Section Hlb . 
Therefore, Grassmannian-Euclidean superposition achieves the 
DoF region. ■ 
2) Ni > N2: In this case, the Grassmannian-Euclidean 
superposition does not match the coherent outer bound dSSl ), 
however, we can partially characterize the DoF region for 
broadcasting with degraded message sets [6] and in the case 
of the more capable channel [5]. For both cases the capacity 
region is characterized by: 

i?i </(U;Yi) 

R1+R2 </(X2;Y2|U)+/(U;Yi) , (56) 
R1+R2 </(X2;Y2) 

where U is an auxiliary random variable. From the last 
inequality we have 



that is 



R1+R2 < N2logp + 0{l), 



di+d2< N2 



(57) 



(58) 



'when the channel is i.i.d. Rayleigh fading this diagonal matrix should be 
identity at high SNR [7]. However, it remains unknown whether the optimal 
choice is an identity matrix at arbitrary SNR. 
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When 7V2 > (1 - Ni/T)Ni, the inner bound in Corollary H 
coincides with the outer bound ( |58] l for < di < N2{1 — 
N2/T), therefore, the DoF is established for this range. For 
di > N2{1~N2/T), the inner and outer bounds do not match, 
but the gap is small when 7V2 is close to A^i. 

When N2 <{1- Ni/T)Ni, the inner bound in Corollary g] 
is inferior to orthogonal transmission and the problem remains 
open. 

V. Conclusion 

Signal superposition based on a multiplicative structure was 
proposed to improve the degrees of freedome of the MIMO 
broadcast channels when one receiver has full CSIR while the 
other receiver has no CSIR. Two superposition signaling meth- 
ods were proposed, both based on product superposition. In the 
Grassmannian superposition, the transmit signal is a product 
of two Grassmannian codewords, producing higher DoF than 
orthogonal transmission while reception is still interference- 
free at both receivers. The Grassmannian-Euclidean superpo- 
sition uses coherent signaling for the receiver with CSIR, and 
Grassmannian signaling for the receiver without CSIR. The 
latter method is shown to attain the optimal DoF region under 
a broad set of channel conditions. 

It is possible to extend the results of this paper to more 
than two receivers. The set of receivers can be divided into 
two sets, one with and one without CSIR. At each point in 
time, the transmitter uses product superposition to broadcast 
to two users, one from each group. A scheduler selects the 
pair of users that is serviced at each time. The time-sharing 
parameters defining the overall rate region are as follows: 
one parameter determines how long a given pair is serviced 
(time sharing between pairs) and for each pair a parameter 
determines the operating point of the degree-of-freedom region 
of that pair To facilitate the case where there are unequal 
number of dynamic and static users, the pair memberships 
are allowed to be non-unique, i.e., there may be two or more 
pairs that contain a given receiver The overall rate region is 
the convex hull of all rate vectors corresponding to all values 
of the time-sharing parameters mentioned above. 

Appendix I 
Proof of Theorem[T] 

A. Achievable Rate for the Dynamic Receiver 

The normalized received signal Yi e at the dynamic 

receiver is 



Yi = 



■ H1X2X1 



1 



VP 



1, 



where Hi G C^^^^^ is the dynamic channel, Xi e C'^^xt 
and X2 S QN2XN1 jjjg isotropically distributed, unitary 
signals for the dynamic and static receivers, respectively, and 
Wi e QNiy.T additive Gaussian noise. 

Let Hi = H1X2 be the A^i x A^i equivalent channel, and 
rewrite (|59]l as 



Yi 



HiXi 



1 



Wi. 



(60) 



The elements in Hi are 

N2 



[Hi]ij = ^ hikXkj, I < i,j < Ni, 



(61) 



k=l 



where h.k = [HiJ^fe and Xkj = [X-2]kr Note that hik is i.i.d. 
random variable with zero mean and unit variance, therefore. 







For = (171,11) we have 



E 

k=l 



^ im,n) 



E[\h,k\^\xkj\' 



N2 
fc=l 



= 1, 



(62) 



(63) 



(64) 



where ( |64] | holds because E[|/iifep] = 1 and each column of 
X2 has unit norm. Therefore, the equivalent channel Hi has 
uncorrected entries with zero mean and unit variance. 
We now find a lower bound for the mutual information 



/(Xi;Yi) = ;i(Yi)-/i(Yi|Xi), 



(65) 



i.e., an achievable rate for the dynamic receiver First, we find 
an upper bound for /i(Yi|Xi). Let yi^ be the row i of Yi. 
Using the independence bound on entropy: 



;i(Yi|Xi)<}_^/i(yi,|Xi). 



(66) 



Let hi be the row i of Hi. Then, conditioned on Xi the 
covariance of yu is 



E[yI,yi,|Xi] = ^XlE[h|h,] Xi 

iVi 



1 



(67) 
(68) 



where the last equality holds since all the elements in Hi are 
uncorrected with zero mean and unit variance. In addition, 
given Xi, the vector yii has zero mean, and therefore, 
/i(yij|Xi) is upper bounded by the differential entropy of 
a multivariate normal random vector with the same covari- 
ance [4]: 



h{yu\Xi) < logdet(^xlXi + ^1 



< Ni log ( 



(59) Combining 



T 1. 
and ( iTOl i, we obtain 



(69) 

(r-iVi)iogp. (70) 



T 



1. 



/i(Yi|Xi) < Ni log (-^ + -) - NiiT - TVi) log p. (71) 
^Ni p' 

After calculating the upper bound for /i(Yi|Xi), we now 
find a lower bound for h{Yi) as follows. 



(72) 
(73) 



MYi)>/i(\/— H1X2X1] 



> H1X2X1 |Hi,X2) 
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where ( 172] ) holds since we remove the noise, and (|73]l holds 
since conditioning does not increase differential entropy. The 
Jacobian from Xi to H1X2X1 is [15, Theorem. 2.1.5]: 



J 



Xi 



T 



-det(HiX2 



N1N2 

Therefore, from (|73] | we have 

MYi) > MXi)+E[logJxJ, 



(74) 



(75) 



where the expectation is with respect to X2 and Hi. Because 
Xi is an isotropically distributed unitary matrix, i.e., uniformly 
distributed on the Stiefel manifold F(T, iVi), we have [7] 



h{Xi)^log\¥{T,Ni 



(76) 



where |F(T, A^i)| is the volume of F(T,iVi) based on the 
Haar measure induced by the Lebesgue measure restricted to 
the Stiefel manifold [15]: 



|F(r,7Vi)| 



n 



Finally, combining ( fTTT i and dTST l. we obtain 

/(Xi;Yi) > 7Vi(r-iVi)logp + log|F(r,Ari)| 

+ E[\ogJx,]-N,yjog{— + -) 



(77) 



1=1 



A^i(r-iVi)iogp + o(i). 



(78) 



(79) 



Normalizing /(Xi; Yi) over T time-slots yields the achiev- 
able rate of the dynamic receiver. 

B. Achievable Rate for the Static Receiver 
The signal received at the static receiver is 



^H2X2Xi + ^W2, 

^1 ^^P 



(80) 



where H2 £ C^^'^^" is the static channel and W2 G C^"""^ 
is additive Gaussian noise. Denote the sub-matrix containing 
the first Ni columns of Y2 with Yj. 



(81) 



where X'j^ G q^i-xNi j-j^g corresponding sub-matrix of Xi, 
and W2 e qNixNi j Gaussian noise. Given H2, the 
mutual information between Y2 and X2 is lower bounded by: 



J(Y2;X2|H2)>/(Yi;X2|H2). 



(82) 



We will focus on /(Y2;X2|H2) to derive a lower bound. 
Using the singular value decomposition (SVD): 



H2 = U^S2V2, 



(83) 



where U2, V2 G C^^^^^ and S2 = diag(Ai, • • • , AjvJ with 
I All > • • • > IAatjI- Since H2 is known and non-singular, the 
dynamic receiver applies 'H^'^ to remove it: 



Ho """Yn 



— X2X' + — w 

^1 ' VP 



The columns of are mutually independent, and each 
column has an autocorrelation: 



(85) 



Because mutual information is independent of the choice of 
coordinates, we have 



/(Y^;X2|H2) =/(H2-iY^;X2|H2) 



(86) 



= /^(H2-iY^|H2)-MH2-'Y^|X2,H2). 

(87) 

Let y2,i be the column i of Hj"^ Y2, then via the independence 
bound on entropy: 

/i(H2-iYi|X2,H2) < ^/i(y2,.|X2,H2). (88) 



From (|84] | and (l85l l, the autocorrelation of y2,i conditioned on 
X2 and H2 is 



R2,» - ^X2E[x'i,,x'/jxt + Ir^ 

iVi p 

T 1 

= -r--X2Rl iX2 H RvF 

iVi p 



(89) 
(90) 



where x'j^ j G C^^ ^ ^ is the column i of X'j^ and has auto- 
correlation Ri,i. The expected value of y2,i is zero and thus 
the differential entropy is maximized if y2,i has multivariate 
normal distribution [4]: 

/i(y2,,|X2,H2) < logdet(-^X2Ri^.Xt + Ir,^) 

= logdet(^V2X2Ri,,X^VJ + is^-^). 

(91) 

The following lemma calculates Ri_i, the autocorrelation of 
a column of an i.d. matrix. 

Lemma 1: If Q G C^^^ is isotropically distributed (i.d.) 
unitary matrix, then each row and column of Q is an i.d. unit 
vector with autocorrelation ^It- 

Proof: From Definition [T] given Q is i.d., for any 
deterministic unitary matrix ^> G C^^'^, 



p(Q*) =p(Q), 



(92) 



which implies that the marginal distribution of each row and 
column remains unchanged under any transform Therefore, 
each row and column is an i.d. unit vector Without loss of 
generality, we consider the first row of Q, denoted as qi. Let 
the autocorrelation of qi be R^ and posses the eigenvalue 
decomposition R, = P^S^P, where P G C"^" is unitary 
and Sg is diagonal. Since qiP^ has the same distribution as 
qi, therefore 



R, =E[qlqi] =PE[qIqi]P 



(93) 



Thus Rq is a diagonal matrix. Furthermore, the diagonal 
elements of have to be identical, i.e., = al-r, otherwise 
Rg would not be rotationally invariant which conflicts with 
(84) the i.d. assumption. Finally, because tr(Rq) = 1, we have 
Hq — It /T. This completes the proof of Lemma [T] ■ 
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Since Xi is an i.d. unitary matrix, based on Lemma [T] the 
autocorrelation of its sub-column is 



R-i,j — Ini/T. 



Therefore, the eigenvalues of V2X2Ri,iX2V2 are 
1 1 



T' 



T 



Ni 



N2-N1 



(94) 



(95) 



We now bound the eigenvalues of the sum of two matrices 
in ( |9T] i. noting that are in ascending order and using a 
theorem of Weyl [16, Theorem 4.3.1]: 



My2,.|X2,H2)<7Vilog(^ + Ajv2 



Appendix II 
Proof of Corollary [T] 

The objective is to find the best dimensions for the transmit 
signals Xi e C^i^* and X2 G C^2xN^^ p^^^ Theorem [T] 
it is easily determined that N2 = N2 is optimal, because the 
pre-log factor of R2 increases with N2 and the pre-log factor 
of Ri is independent of N2 (given Ni < N2). 

To find the optimal values of Ni , T, we start by relaxing 
the variables by allowing them to be continuous valued, i.e. 
A^i — > X and T ^ y, and then showing via the derivatives 
that the cost functions are monotonic, therefore optimal values 
reside at the boundaries, which are indeed integers. 

Using the DoF expression from Theorem [T] the slope 
between two achievable points I?2 and 2?3 is: 

x{N2- x)/y- N2 



+ {N2-N,)log-Xj?. 



(96) 



From 



and (1961 1, we have: 



f{x,y) 

Therefore, for all < a; < iVi, 
df{x,y) _ 



x{l - x/y) 



h{H^'Y'2\X2, H2) < log (— + A^2) 

-NiiN2~Ni) log X]^l~NiiN2-Ni)\ogp. (97) 
We now calculate a lower bound for ft,(H2"^Y2|H2): 

(98) 



> 0. 



(106) 



(107) 



dy {y - xY 

We wish to maximize / with the constraint y <T, thus y = T 
is optimal. 

Substituting y = T into f(x,y), we have 



hiU-'Y'2\U2) > h{\l —X2X[\U2 



df{x,T) 
dx 



{T^ N2)x^ +TN2X-T^N2 
x^{T — xY 



(108) 



>M\/^x2x;|x;,H2). (99) 



If T = N2, since x < T/2, then f| > 0. In this case 
X = Ni maximizes the DoF region. 

If T 7^ N2, let T = aN2. When < a < |, one can verify 



From [15, Theorem. 2.1.5], given X; the Jacobian of the f^at |^ > for all a; > 0. Thus, :r = A^i is optimal. When 



transformation from X2 to ^/ jj-X.2X.[ is: 



a > 4, let 



^2 



det(X'i)^^ 



Therefore, from the right hand side of ( |99] l we have 
/i(H2-^Yi|H2) > /i(X2) +E[log JxJ, 



(100) 



(101) 



9/ 
dx 



Xl,2 



0, and we have the corresponding solutions: 



'aN2 ± aN2 \/l + 4(a - 1) 



(109) 



When I < a < 1, the above solutions are positive, where the 



smaller one is: 



where the expectation is with respect to X'^^. Because X2 is 
uniformly distributed on the Stiefel manifold ¥{N2, Ni), we 
have [7] 

/i(Xi) = log|F(iV2,A^i)|, (102) 

where |F(iV2, is the volume of F(A^2, -^1), which is given 
by [15]: 



Xl 



aN2 - aN2 ^/T^^^T^^ 
2(1 - a) 



(110) 



Since §^ > at x = 0, we have || > for < x < iVi. 
When a > 1, the (only) positive solution of ( |109t is: 

aN2 + aN2 ^/lT^{a^^ 



Xl 



2(«-l) 



> Ni. 



(Ill) 



Once again, since ^ > at a; = 0, we have > for 
{)<x<Ni. 

Therefore, for all cases, x — Ni maximizes the DoF region. 

Appendix III 
Proof of Theorem[2] 

A. Achievable Rate for the Dynamic Receiver 

The proof is similar to the proof for Theorem [T] so we only 
outline key steps. The received signal at the dynamic receiver 
is 

1 



dl 



W{N2,Ni)\ 



TV, 

n 



27r' 



(103) 



i=N2-Ni + l ^ ' 

Finally, substituting ( llOlb and Wt\ into dSTI i. we have 

/(Y^;X2|H2) = iVi(7V2-A^i)logp + 0(l). (104) 
Hence, the rate achieved by the static receiver is 

1 N^ 

-E[/(Y^; X2IH2)] = -^(N2 - N,) logp + 0(1), (105) 
where the expectation is with respect to H2. 



Yi 



T 



NiN: 



■H1X2X1 



Wi, 



(112) 
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where Yi e C^^^^ and Hi e C^^^^^ and Wi e C^^^^ is 
additive Gaussian noise. We establish a lower bound for the 
mutual information between Xi and Yi: 

/(Xi;Yi) = /i(Yi)-/i(Yi|Xi). (113) 

In the above equation, we have 



/i(YilXi) <yhiyu\Xi) 



i=l 



One can verify 



hiyu\Xi) < logdet(^xlXi + ^l). (114) 
Finally, we obtain 
MYilXi) < iVi'log(^ + ^) - iVi(T-iVi)logp. (115) 
The lower bound is given by: 

h(Yi) > log\¥{T,Ni)\+E[\ogJx,l (116) 
where the expectation is with respect to Hi and X2, and 



T 



-det(HiX2 



Ni 



N1N2 

Combining ( II 151 ) and dl 171 ). and normalizing over T time 
slots leads to the achievable rate of the dynamic receiver 

B. Achievable Rate for the Static Receiver 

The received signal at the static receiver is Y2 G C^^y-T 



Conditioned on H2, the decodable dynamic rate at the static 
receiver is 

/(Xi;Y2|H2)=/(Xi;Yi|H2), 
which is lower bounded by 

/(Xi;Yi'|H2) =/i(Y^'|H2)-/i(Y;,'|Xi,H2). (122) 
Using the independence bound for ft.(Y2|Xi,H2) yields 

/i(Y^'|Xi,H2) < ^My2*|Xi,H2), (123) 

4=1 

where y2,; is the row i of Yj. Let X2i be the row i of Xj, 
for 1 < i < iVi. Since X^' € C^^^^i have i.i.d. C7V(0, 1) 
entries, all the row vectors X2i have the same autocorrelation 
Ini- 

Conditioned on Xi, the autocorrelation of y2i — AiX2iXi 
is given by 

E[yt^y2.|Xi,H2] = A.^xjXi + -It. (124) 

Therefore, 

^^^^^ My2.|Xi,H2) <logdet(A2xtXi + ilT), (125) 



Y2 = 



-^H2X2Xi + ^W2, 

N1N2 ^ 



where H2 G C^^^^^ is the static channel, and W2 G C^i^^ 
is additive Gaussian noise. 

We first calculate the decodable dynamic rate at the static 
receiver in the next lemma. 

Lemma 2: The static receiver is able to decode the dynamic 
rate _Ri if 

i?i < iVi(l-iVi/T)logp + 0(l). (118) 
Proof: Use the SVD for H2 and re-write the signal at 
the static receiver as 



Y, = 



-^U^S2V2X2Xi + ^W2, (119) 

Because X2 is an isotropically distributed unitary matrix, 
X2 = V2X2 has the same distribution as X2, i.e., a matrix 
of i.i.d. C7V(0, 1). Rotate Y2 with U2 



^ U2Y2 = 



S2X'2Xi + — W;, (120) 

where W2 is i.i.d. Gaussian noise. Let Yj G Q^i^T j-j^g 
first A^i rows of Y2, i.e., the rows corresponding to the largest 
A^i singular modes of H2, that is |Ai| > ••• > IAatJ. We 
denote the corresponding A^i x A^i sub-matrix of X2 by Xj. 
Then, 

Y^' -diag(Ai,--- ,Aa,JX'2'Xi + ^W^'. (121) 



iVi log (A^ + -) - (T - iVi) log p. (126) 



and subsequently. 



MY^'|Xi,H2) < ^log(A,? + -)-iVi(T-iVi)logp. (127) 



We now find a lower bound for /i(Y2 IH2). Similar to ( IIOII ). 
we have 

;i(Y^'|H2) > /i(Xi)+E[JxJ, (128) 



where the expectation is with respect to X2, and 

T 

MXi)-iF(T,7Vi)i= n 



i=T-N 



1 + 1 ^ ' 



and 



AT, 



(129) 



(130) 



1=1 



Finally, taking expectation over H2, we obtain 

E[/(Xi;Y2|H2)] > 7V,(r-iVi)logp + MXi)+E[JxJ 

-E[5:iog(A? + i)] 

= 7Vi(T-iVi)logp + 0(l). (131) 

This completes the proof for Lemma |2l ■ 
Therefore, the transmitter is able to send Ni{\ — Ni/T) DoF 
to the dynamic receiver, while ensuring the dynamic signal is 
decoded at the static receiver 

After decoding Xi, the static receiver removes the interfer- 
ence: 



Y2Xl 



-^H2X2 + ^Wi, 
^1^2 ^/P 



(132) 
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where € qn^xNi equivalent noise whose entries are 
stiU i.i.d. CN{Q, 1). The equivalent channel for the static re- 
ceiver is now a point-to-point MIMO channel. With Gaussian 
input X2, we have [13] 

/(X2;Y2|H2) = 7Vi7V2logp + 0(l). (133) 

Normahzing /(X2;Y2|H2) over T time-slots yields the 
achievable rate of the static receiver. 
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